Formal context reduction in deriving concept hierarchies from corpora using adaptive evolutionary clustering algorithm star
نویسندگان
چکیده
Abstract It is beneficial to automate the process of deriving concept hierarchies from corpora since a manual construction typically time-consuming and resource-intensive process. As such, overall learning encompasses set steps: parsing text into sentences, splitting sentences then tokenising it. After lemmatisation step, pairs are extracted using formal context analysis (FCA). However, there might be some uninteresting erroneous in context. Generating may lead process, so size reduction require remove uninterested pairs, taking less time extract lattice accordingly. In this premise, study aims propose two frameworks: (1) A framework review current corpus utilising (FCA); (2) decrease context’s ambiguity first an adaptive version evolutionary clustering algorithm (ECA*). Experiments conducted by applying 385 sample Wikipedia on frameworks examine reducing context, which leads yield hierarchy. The resulting evaluated standard one lattice-invariants. Accordingly, homomorphic between lattices preserves quality 89% contrast basic ones, reduced inherits structural relation one. ECA* examined against its four counterpart baseline algorithms (Fuzzy K-means, JBOS approach, AddIntent algorithm, FastAddExtent) measure execution random datasets with different densities (fill ratios). results show that performs faster than other mentioned competitive techniques fill ratios.
منابع مشابه
Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis
We present a novel approach to the automatic acquisition of taxonomies or concept hierarchies from a text corpus. The approach is based on Formal Concept Analysis (FCA), a method mainly used for the analysis of data, i.e. for investigating and processing explicitly given information. We follow Harris’ distributional hypothesis and model the context of a certain term as a vector representing syn...
متن کاملDeriving Concept Hierarchies from Text by Smooth Formal Concept Analysis
We present a novel approach to the automatic acquisition of taxonomies or concept hierarchies from texts based on Formal Concept Analysis. Our approach is based on the assumption that verbs pose strong selectional restrictions on their arguments. The conceptual hierarchy is then built on the basis of the inclusion relations between the extensions of the selectional restrictions of all the verbs...
متن کاملClustering Concept Hierarchies from Text
Abstract We present a novel approach to learning taxonomies or concept hierarchies from text. The approach is based on Formal Concept Analysis, a method mainly used for the analysis of data, i.e. for investigating and processing explicitly given information. Our approach is based on the distributional hypothesis, i.e. that nouns or terms are similar to the extent to which they share contexts. F...
متن کاملApplication of modified balanced iterative reducing and clustering using hierarchies algorithm in parceling of brain performance using fMRI data
Introduction: Clustering of human brain is a very useful tool for diagnosis, treatment, and tracking of brain tumors. There are several methods in this category in order to do this. In this study, modified balanced iterative reducing and clustering using hierarchies (m-BIRCH) was introduced for brain activation clustering. This algorithm has an appropriate speed and good scalability in dealing ...
متن کاملLearning Concept Hierarchies from Text with a Guided Hierarchical Clustering Algorithm
We present an approach for the automatic induction of concept hierarchies from text collections. We propose a novel guided agglomerative hierarchical clustering algorithm exploiting a hypernym oracle to drive the clustering process. By inherently integrating the hypernym oracle into the clustering algorithm, we overcome two main problems of unsupervised clustering approaches relying on the dist...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Complex & Intelligent Systems
سال: 2021
ISSN: ['2198-6053', '2199-4536']
DOI: https://doi.org/10.1007/s40747-021-00422-w